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Abstract 

This paper investigates the unknown input observability problem in the nonlinear case 
under the assumption that the unknown inputs are differentiable functions of time (up to 
a given order). The goal is not to design new observers but to provide simple analytic 
conditions in order to check the weak local observability of the state. The analysis starts 
by extending the observability rank condition. This is obtained by a state augmentation 
and is called the extended observability rank condition (first contribution). The proposed 
extension of the observability rank condition only provides sufficient conditions for the state 
observability. On the other hand, in the case of a single unknown input, the paper provides 
a simple algorithm to directly obtain the entire observable codistribution (second and main 
contribution). As in the standard case of only known inputs, the observable codistribution is 
obtained by recursively computing the Lie derivatives along the vector fields that characterize 
the dynamics. However, in correspondence of the unknown input, the corresponding vector 
field must be suitably rescaled. Additionally, the Lie derivatives must be computed also along 
a new set of vector fields that are obtained by recursively performing suitable Lie bracketing 
of the vector fields that define the dynamics. In practice, the entire observable codistribution 
is obtained by a very simple recursive algorithm. However, the analytic derivations required 
to prove that this codistribution fully characterizes the weak local observability of the state 
are complex. Finally, it is shown that the recursive algorithm converges in a finite number 
of steps and the criterion to establish that the convergence has been reached is provided. 
Also this proof is based on several tricky analytical steps. Several applications illustrate the 
derived theoretical results, both in the case of a single unknown input and in the case of 
multiple unknown inputs. 
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1 Introduction 


The problem of state observability for systems driven by unknown inputs (UI) is a fundamental 
problem in control theory. This problem was introduced and firstly investigated in the seventies 
paEamacs], a huge effort has then been devoted to design observers for both linear and 
nonlinear systems in presence of UI, e.g., m m Ei in na na nn nzi na nn na na mi- 

The goal of this paper is not to design new observers for systems driven by UI but to provide 
simple analytic conditions in order to check the weak local observability of the state. The obtained 
results hold for systems whose dynamics are nonlinear in the state and affine in both the known 
and the unknown inputs. Additionally, the unknown inputs are supposed to be smooth functions 
of time (specifically, they are supposed to be C k , for a suitable integer k). 

In [19] the observability properties of a nonlinear system are derived starting from the defi¬ 
nition of indistinguishable states. According to this definition, the Lie derivatives of any output 
computed along any direction allowed by the system dynamics take the same values at the states 
which are indistinguishable. Hence, if a given state x belongs to the indistinguishable set of a 
state Xq (i.e., to I XQ ) all the Lie derivatives computed at x and at Xq take the same values. This 
is a fundamental property. In particular, based on this property, the observability rank condition 
was introduced in m- 

Our first objective is to extend the observability rank condition. For, we introduce a new 
definition of indistinguishable states for the case UI (section [2} . Then, in section [3] we introduce 
a new system by a suitable state extension. For this extended system, we show that, the Lie 
derivatives of the outputs up to a given order, take the same values at the states which are 
indistinguishable. In other words, the new system satisfies the same property derived in m 
mentioned above and this allows us to extend the observability rank condition (section [4]). We 
will refer to this extension as to the Extended Observability Rank Condition ( EORC ). 

The new system is obtained by a state augmentation. In particular, the augmented state is 
obtained by including the unknown inputs together with their time-derivatives up to given order. 
This augmented state has already been considered in the past. Specifically, in |3| the authors 
adopted this augmented state to investigate the observability properties of a fundamental problem 
in the framework of mobile robotics (the bearing SLAM). In particular, starting from the idea 
of including the time-derivatives of the unknown input in the state, in [3] a sufficient condition 
for the state observability has been provided. 

The EORC is based on the computation of a codistribution defined in the augmented space. 
In other words, the EORC allows us to check the weak local observability of the original state 
together with its extension and not directly of the original state. This makes the computational 
cost dependent on the dimension of the augmented state. Additionally, the EORC only pro¬ 
vides sufficient conditions for the weak local observability of the original state since the state 
augmentation can be continued indefinitely. For these reasons, the paper focuses on the following 
two fundamental issues: 

• Understanding if it is possible to derive the weak local observability of the original state by 
computing a codistribution defined in the original space, namely a codistribution consisting 
of covectors with the same dimension of the original state. 

• Understanding if there exists a given augmented state such that, by further augmenting 
the state, the observability properties of the original state provided by EORC remain 
unvaried. 

Both these issues have been fully addressed in the case of a single unknown input (see theorems 

[l] and [2| . 
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Thanks to the result stated by theorem |T] (section [6| , the algorithm in definition [3] in section 
[fi] (for the case of a single known input) and in definition [fi] in section [8] (for the case of multiple 
known inputs) can be used to obtain the entire observable codistribution. In other words, the 
observability properties of the original state are obtained by a very simple algorithm. As it will 
be seen, the analytic derivations required to prove theorem [l] are complex and we are currently 
extending them to the multiple unknown inputs case. Theorem [2] (section [7j ensures the conver¬ 
gence of the algorithm in a finite number of steps and it also provides the criterion to establish 
that this convergence has been reached. Also this proof is based on several tricky and complex 
analytical steps. 

Both theorems [l] and [2] are first proved in the case of a single known input (sections [6] and [7]) 
but in section [8] their validity is extended to the case of multiple known inputs. 

All the theoretical results are illustrated in section [9] by deriving the observability properties 
of several nonlinear systems driven by unknown inputs. 


2 Basic Definitions 

In the sequel we will refer to a nonlinear control system with m u known inputs [u = [iq, • • • , u m ff\ T ) 
and m w unknown inputs or disturbances (w = [u>i, • • • ,w mw \ T ). The state is the vector x £ Ad, 
with M an open set of K". We assume that the dynamics are nonlinear with respect to the 
state and affine with respect to the inputs (both known and unknown). Finally, for the sake of 
simplicity, we will refer to the case of a single output y (the extension to multiple outputs is 
straightforward). Our system is characterized by the following equations: 

! m u m w 

X = fo(x) + fi(x)ui + 9 j(x)wj 

,=i j=i I 1 ) 

y = h(x) 

where fi(x), i = 0,1, • • • , m u , and gj(x ), j = 1, • • • , m w , are vector fields in Ad and the function 
h(x) is a scalar function defined on the open set Ad. For the sake of simplicity, we will assume 
that all these functions are analytic functions in Ad. 

Let us consider the time interval I = [0, Tj. Note that, since the equations in ([l]| do not 
depend explicitly on time, this can be considered as a general time interval of length T. In the 
sequel, we will assume that the solution of 0 exists in X and we will denote by x(t; x^; u ; w) 
the state at a given time t £ X, when cc(0) = xq and the known input and the disturbance are 
u(t) and w(t), respectively, Vf £ 1. 

We introduce the following definition: 

Definition 1 (Indistinguishable states in presence of UI) Two states x a and x^ are in¬ 
distinguishable if, for any u(t) (the known input vector function), there exist w a (t ) and Wb(t) 
(i.e., two unknown input vector functions in general, but not necessarily, different from each 
other) such that h(x(t ; x a ; u; w a )) = h(x(t ; Xb ; u ; Wb )) Vf £ T. 

This definition states that, if x a and Xb are indistinguishable, then, for any known input, by 
looking at the output during the time interval X, we cannot conclude if the initial state was 
x a and the disturbance w a or if the initial state was Xb and the disturbance u>b- We remark 
that, contrary to the definition of indistinguishable states in the case without disturbances, 
the new definition does not establish an equivalence relation. Indeed, we can have x a and Xb 
indistinguishable, Xb and x c indistinguishable but x a and x c are not indistinguishable. As in the 
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case of known inputs, given xq, the indistinguishable set I Xo is the set of all the states x such 
that x and xq are indistinguishable. Starting from this definition, we can use exactly the same 
definitions of observability and weak local observability adopted in the case without disturbances. 


3 Extended system and basic properties 

The results contained in this and in the next section are aslo available in |35] . 

In order to extend the observability rank condition to the case of unknown inputs we introduce 
a new system (the extended system) such that its Lie derivatives are constant on the indistin¬ 
guishable sets. The new system will be denoted by £^ fc ). It is simply obtained by extending the 
original state by including the unknown inputs together with their time derivatives. Specifically, 
we denote by k x the extended state that includes the time derivatives up to the (fc — 1)—order: 


,,(i) t 


,(fc-l) TiT 


( 2 ) 


where w^ = 4^ and k x £ M (k \ with M^ an open set of R n + fem ™. From (JTJ) it is immediate 
to obtain the dynamics for the extended state: 


where: 


fe i = /o Cfe) (* 


^)+E/i fc) ( a; K+E i r +(fe_1)m ” 


L n-\-km u 


+j (fc) 

W j 


3 =1 


(3) 


jfE 


x) = 


fo(x) + 

y;! 1 ) 

w ( 2 ) 

w^- 1 ) 




M x) 

0 km w 


(4) 


(5) 


and we denoted by 0 m the to— dimensional zero column vector and by l l m the to— dimensional 
unit column vector, with 1 in the I th position and 0 elsewhere. We remark that the resulting 
system has still m u known inputs and m w disturbances. However, while the m u known inputs 
coincide with the original ones, the m w unknown inputs are now the k —order time derivatives of 
the original disturbances. The state evolution depends on the known inputs via the vector fields 
f- k \ (i = 1, ,m u ) and it depends on the disturbances via the unit vectors 

(j = 1, • • • ,m w ). Finally, we remark that only the vector field depends on the new state 
elements. 

In the rest of this section we derive several properties satisfied by 


Lemma 1 Let us consider the system Y,( k \ The Lie derivatives of the output up to the m th 
order (m < k) are independent of w^\ j = 1, • • • ,m w , V/ > m. 

Proof: We proceed by induction on to for any k. When m = 0 we only have one zero-order 
Lie derivative (i.e., h(x)) : which only depends on x, namely it is independent of w^\ V/ > 0. 
Let us assume that the previous assert is true for to and let us prove that it holds for to + 1. If 
it is true for to, any Lie derivative up to the m th order is independent of w^\ for any / > to. 
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In other words, the analytical expression of any Lie derivative up to the m —order is represented 
by a function g(x, w, w (1) , ■ • • Hence, Vg = [|f, , g|fry,--- , 9u ,fln ,0( t - m ) m J. It 

is immediate to realize that the product of this gradient by any vector filed in (|3]) depends at 
most on u>( m \ i.e., it is independent of w^\ V/ > m + 1 ■ 

A simple consequence of this lemma are the following two properties: 


Proposition 1 Let us consider the system El fc ). The Lie derivatives of the output up to the k th 
order along at least one vector among (j = ^-i"' i m w) are identically zero. 

Proof: From the previous lemma it follows that all the Lie derivatives, up to the (fc—1)—order 
are independent of which are the last m w components of the extended state in (|2j) . Then, 

the proof follows from the fact that any vector among (j = l,--- ,m w ) has the 

first n + (k — 1)to^, components equal to zero ■ 


Proposition 2 The Lie derivatives of the output up to the k ttL order along any vector field /g , 
f[ k \ ■ ■ ■ , fml for the system coincide with the same Lie derivatives for the system E( fc+ b 

Proof: We proceed by induction on m for any k. When m = 0 we only have one zero-order 
Lie derivative (i.e., h(x)), which is obviously the same for the two systems, E^and E^ fc+1 l. Let 
us assume that the previous assert is true for m and let us prove that it holds for m+1 < k. If it is 
true for m , any Lie derivative up to the m th order is the same for the two systems. Additionally, 
from lemma 111 we know that these Lie derivatives are independent of w^\ V/ > m. The proof 
follows from the fact that the first n + mm w components of fl/' 1 , f[ k \ ■ • • , fml coincide with the 
first n + mm w components of /g fc+1) , f[ k+1 \ ■ ■ • , fm+ 1 ' > when m < k ■ 

In the sequel we will use the notation: £ = [w T , w ^ T , • • • , w^ k ~^ T ) T . In this notation 
we have k x = [x T , £ T ] T . We also denote by E^the original system, i.e., the one characterized 
by the state x and the equations in (jT|) . 

It holds the following fundamental result: 

Proposition 3 If x a and Xb are indistinguishable, there exist £ a and fb such that, in El fc q the 
Lie derivatives of the output up to the k th -order, along all the vector fields that characterize the 
dynamics of E^ fc l, take the same values at [x a ,£ a ] an d [xb,fb\- 

Proof: We consider a piecewise-constant input u as follows (i = 1, • • • ,m u ): 

Ui(t) = (6) 


{ u\ t e [o, t{) 
u~ t G \t \, ti + tf) 

U® t £ [tl + t2 + • • • + tg-l, tl + f2 + • • ■ + tg -1 + tg) 

Since x a and Xb are indistinguishable, there exist two disturbance functions w a (t) and Wb(t) such 
that the output coincide on x a and Xb■ In particular, we can write: 

h{x(t; [x a ,£,a\\ u; W {k) )) = h(x(t; u; w { b k) )) (7) 
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\/t £ [0, ti + t 2 H-+ t g -1 + f g ) C X. On the other hand, by taking the two quantities in (|7| at 

t = ti +t% + • • • + t g -1 + t g , we can consider them as functions of the g arguments t\, t 2 , • • ■ , t g . 
Hence, by differentiating with respect to all these variables, we also have: 

d 9 h{x{ti H-b t g -, [z; 0 ,£ 0 ]; u; wi fc) )) _ 

dhdt 2 ---dt.g ~ 1 j 

_ d 9 h(x(t 1 H-b t g -, [a:;,, £&]; ft; w ( b k) )) 

dt\dt 2 • • • dt g 

By computing the previous derivatives at ti = t 2 = • • • = t g = 0 and by using proposition [lj we 
obtain, if g < k: 


r 9 h 

ts p p p I u 


— r 9 h 

X = X a VlV2---Vg 


X = Xb 


(9) 




£ = & 


where 0 h = ft 
choices of 


(fc) 


YJi=\ fi k)u i , h = 1 , • • • , g. The equality in 
i .^ . By appropriately selecting these u\, - ■ ■ ,\ 


must hold for all possible 
, we finally obtain: 


C 9 

Dl«2-l ’g 


h 


x = x c 


= C? h 


X = Xb 




£ = 6 


( 10 ) 


where viv 2 ■••v g are vector fields belonging to the set {fo k \f[ k \ - • • , fml} ■ 

In {T9] it was also defined the concept of V— indistinguishable states, with V a subset of the 
definition set that includes the two considered states. From this definition and the previous 
proof we can alleviate the assumptions in the previous proposition. Specifically, we have the 
following: 


Remark 1 The statement of proposition^ also holds if x a and Xb are V—indistinguishable. 

Thanks to the results stated by propositions [2] and [3] we will introduce the extension of the 
observability rank condition in the next section. 


4 Extension of the Observability Rank condition 

According to the observability rank condition, the weak local observability of the system in ([!]) 
with m w = 0 at a given point xq can be investigated by analyzing the codistribution generated 
by the gradients of the Lie derivatives of its output. Specifically, if the dimension of this codis¬ 
tribution is equal to the dimension of the state on a given neighbourhood of Xq, we conclude 
that the state is weakly locally observable at x$ (theorem 3.1 in HU). We can also check the 
weak local observability of a subset of the state components. Specifically, a given component of 
the state is weakly locally observable at Xq, if its gradient belongs to the aforementioned codis- 
tributiorQ The proof of theorem 3.1 in pT9j is based on the fact that all the Lie derivatives (up 
to any order) of the output computed along any direction allowed by the system dynamics take 
the same values at the states which are indistinguishable. 

1 A component of the state is observable at xq if it is constant on the indistinguishable set I xo . 
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Let us consider now the general case, i.e., when m w ^ 0. In the extended system (£ 
we know that the Lie derivatives up to the A;—order satisfy the same property (see proposition 
[3]). Therefore, we can extend the validity of theorem 3.1 in [T9] to our case, provided that we 
suitably augment the state and that we only include the Lie derivatives up to the k —order to 
build the observable codistribution. 

In the sequel, we will introduce the following notation: 

• Cl m will denote the observable codistribution for £( fc i that includes all the Lie derivatives 
of the output along f^ k \ f[ k \- ■ ■ , fm'i up to the order m < fc; 

• The symbol d will denote the gradient with respect to the extended state in <© and the 
symbol d x will denote the gradient only respect to x; 

• For a given codistribution A and a given vector field 77 , we will denote by C n A the codis¬ 
tribution whose covectors are the Lie derivatives along 77 of the covectors in A (we are 
obviously assuming that the dimension of these covectors coincides with the dimension of 
»?)• 

• Given two vector spaces V\ and Vi, we will denote with V\ + Vi their sum, i.e., the span of 
all the generators of both V\ and Vi. 

• For a given V C M i ' k '> and a given [x 0 , £ 0 ] £ V, we will denote with I^ xq ^ the set of all 
the states V —indistinguishable from [xo, £o]- 

The codistribution Sl m can be computed recursively by the following algorithm: 

Algorithm 1 (Computation of Cl m , m < k) 

Set Clo = span{dh}; 

Set i = 0 
while i < to do 

Set i = i + 1 

Set Cli = fb-i + Ei /= 0 1 

end while 

Let us denote by X j the j th component of the state (j = 1, ■ ■ • ,n). We introduce the following 
definition: 

Definition 2 ( EORC ) For the system the j th component of the state (i.e., Xj, j = 

1 , ,n) satisfies the extended observability rank condition at [cco,£o] if dxj £ Clk at [xo,£o]- 
If this holds Wj = 1, • • • , n, we say that the state x satisfies the extended observability rank 
condition at [xo,£o] 

We have the following result, which is the extension of the result stated by theorem 3.1 in m- 

Proposition 4 For if Xj (j = 1, • • • n) satisfies the observability rank condition at [xo,£oL 
then Xj is weakly locally observable at [xo,£o]- Additionally, Xj remains weakly locally observable 
by further extending the state (i.e., in every system (f > k)). 

Proof: We prove that it exists an open neighbourhood U of [xo,£o] such that, for every open 
neighbourhood V C U of [xo,£o]> x j is constant on the set ^ o j. Since dxj £ f Ik at [xo,^o]> it 
exists some open neighborhood U of [xo, Co], such that Xj can be expressed in terms of the Lie 
derivatives of h along the directions f-)' 1 (f = 0,1, ,m u ) up to the k order. If V C U is an 


open neighborhood of [a;o,£o]> then proposition [3] and remark |T] imply that all the Lie derivatives 
up to the k order are constant on the set iV ^ and, consequently, also x 3 is constant on this 
set. Finally, the fact that x :l is weakly locally observable in every system (/ > k) directly 
follows from proposition [2] ■ 

In accordance with the previous result, the EORC is a tool to analyze the observability 
properties of a nonlinear system driven by known and unknown inputs. However, we remark two 
important limitations of the EORC. The former consists in the fact that the state augmentation 
can be continued indefinitely. As a result, the EORC only provides sufficient conditions for 
the weak local observability of the state components. The latter regards the computational cost 
demanded to check if it is satisfied. Specifically, the computation demanded to check if dxj 
belongs to can be very complex because by increasing k we also increase the dimension of 
the extended state. 

In the rest of this paper, we will focus our attention on these fundamental issues and we will 
provide the main paper contributions: 

• obtaining a new codistibution (Qk) that is the span of covectors whose dimension is n (i.e., 
independent of the state extension) such that d x Xj £ f Ik if and only if dxj £ f2*,; 

• understanding if there exists a given k such that, if dxj ^ O k , then dxj ^ Clk Vfc > k. 

We fully address both these issues in the case m w = 1. In section [5] we introduce the basic 
equations that characterize this case. In section [6] we provide a complete answer to the first 
issue by operating a separation on the codistribution generated by all the Lie derivatives up 
to the k— order. Specifically, we prove that the observable codistribution can be splitted into 
two codistributions. The former is generated by the gradients of scalar functions that only 
depend on the original state. The latter is generated by the gradients of scalar functions that 
depend on the entire augmented state. However, this latter codistribution can be ignored when 
deriving the observability properties of the original state. The former codistribution, namely 
the one generated by the gradients of scalar functions that only depend on the original state, is 
defined by a simple recursive algorithm. In section [7] we provide a complete answer to the second 
issue by proving that this algorithm converges in a finite number of steps and by also providing 
the criterion to establish that the convergence of the algorithm has been reached (theorem [2]) . 
Also this proof is based on several tricky analytic steps. For the sake of clarity, we start this 
discussion by considering the case when the system is characterized by a single known input, 
i.e., when m u = 1 (sections [5] [b] and 7|. In particular, both theorems [l] and [2] are proved in this 
simplified case. However, in section 8] their validity is extended to the case of multiple known 
inputs (i.e., Vm u > 1). 


5 Single known Input and single disturbance 

We will refer to the following system: 


x = f(x)u + g{x)w 
V = h{x) 


( 11 ) 


In other words, we consider the case when /o is the null vector and m u = m w = 1. In this case, 
the extended state that includes the time derivatives up to the (k — 1)—order is: 


k x= [x T , w, w (1) , 


w 


(k- 


1J ] 5 


( 12 ) 
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The dimension of the extended state is n + k. From (111 it is immediate to obtain the dynamics 
for the extended state: 


x ■ 


where: 


-G( k x) + F(x)u + i:l k k w^ 


(13) 


F = 


’ fix) ' 


g{x)w 

0 


wW 

0 

G = 

w (2) 

0 



0 


0 


(14) 


In the sequel, we will denote by L g the first order Lie derivative of the function h{x) along the 
vector held g(x), i.e., L g = L g h. The derivations provided in the next sections are based on the 
assumption that L g ^0 on a given neighbourhood of xq. We conclude this section by showing 
that, when this assumption does not hold, it is possible to introduce new coordinates and to show 
that the observability properties can be investigated starting from a new output that satisfies 
the assumption. 

Let us suppose that L g = 0 on a given neighbourhood of xq. We introduce the following 


system associated with the system in (111: 


x = f(x) + g{x)u 
V = h{x) 


(15) 


This is a system without disturbances and with a single known input u. Let us denote by r the 
relative degree of this system at xq. Since L g = 0 on a given neighbourhood of xo, we have r > 1. 
Additionally, we can introduce the following new local coordinates (see proposition 4.1.3 in my 


x' = Q(x) = 

such that the first new r coordinates are: 


Qi(x) 

Qn(x) 


(16) 


Qi(x) = h(x), Q 2 (x) = £fh(x), Q r (x) = C r f 1 h(x) (17) 

Now let us derive the equations of the original system (i.e., the one in pT| ) in these new coor¬ 
dinates. We have: 


{ x! = f{x’)u + g(x')w 
V = x 'i 

where / and g have the following structure: 


x' 2 


0 

*3 


0 

x' r 

9 = 

0 

fr(x') 


9r( x> ) 

. fn{x') _ 


_ 9n{x') _ 


(18) 


(19) 
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We remark that the first r components of x' are weakly locally observable since they are the 
output and its Lie derivatives along / up to the (r — 1)—order (note that we do not need to 
augment the state to use the first (r — 1) Lie derivatives because all the Lie derivatives up to 
the (r — 1)—order that includes at least one direction along g vanish automatically). In order to 
inv esti gate the observability properties of the remaining components, we augment the state as 
in (12 1 and we can consider the new output h{x') = x' r . We set L l g = g r = L g C^ l h ^ 0. 


6 The observable codistribution (Q) 

In this section we operate a separation on the codistribution generated by all the Lie derivatives 
up to the m—order (to < k). Specifically, we prove that this codistribution can be splitted 
into two codistributions. The former is generated by the gradients of scalar functions that only 
depend on the original state. The latter is generated by the gradients of scalar functions that 
depend on the entire augmented state. However, this latter codistribution can be ignored when 
deriving the observability properties of the original state. 

The observable codistribution is given by the algorithm [l] and, in this case m u = m w = 1, it 
reduces to the span of the gradients of all the Lie derivatives along F and G up to the fc-order. 
Hence, for any to < k, it is obtained recursively by the following algorithm: 

1. H 0 = span{dh}; 

2. fl m = flm —1 T F'G^m — 1 T 1 

For a given m < k we define the vector (f) m £ R” by the following algorithm: 

i- <l>o = /; 

2 . cf m = 31 

where the parenthesis [•, •] denote the Lie brackets of vector fields. Similarly, we define <h m £ R n+k 
by the following algorithm: 

1. $ 0 = F; 

2. <I> m = G] 

By a direct computation it is easy to realize that <f> m has the last k components identically null. 
In the sequel, we will denote by 5> m the vector in R” that contains the first n components of 

$ m . In other words, >I> m = [l>^, 0^] T . Additionally, we set 0 m = 

We define the 11 codistribution as follows (see definition [5] in section [ 8 ] for the case when 
m u > 1): 

Definition 3 (f l codistribution, m u = m w = 1) This codistribution is defined recursively by 
the following algorithm: 

1. Hq = d x h; 

= fl m _i T T -(- Ta, -,d x h 

JX 
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Note that this codistribution is completely integrable by construction. More importantly, its 
generators are the gradients of functions that only depend on the original state ( x ) and not on 
its extension. In the sequel, we need to embed this codistribution in R n+fc . We will denote by 
[n m ,0 fc ] the codistribution made by covectors whose first n components are covectors in £l m 
and the last components are all zero. Additionally, we will denote by L m the codistribution 
that is the span of the Lie derivatives of dh up to the order m along the vector G, i.e., L m = 
span{C G dh , C G dh, • • • , C G dh}. 

We finally introduce the following codistribution: 

Definition 4 (II codistribution) This codistribution is defined as follows: f l m = [f2 m ,0 k]+L m 

The codistribution (l rn consists of two parts. Specifically, we can select a basis that con¬ 
sists of exact differentials that are the gradients of functions that only depend on the origi¬ 
nal state (x) and not on its extension (these are the generators of [fl m ,0fc]) and the gradients 
C G dh, C G dh, ■ ■ ■ , C G dh. The second set of generators, i.e., the gradients C G dh, C G dh, • • • , C G dh, 
are m and, with respect to the first set, they are gradients of functions that also depend on the 
state extension £ = [w, w W,--- , u/ m-1 )] T . 

We have the following result: 

Lemma 2 Let us denote with Xj the j th component of the state (j = 1, • • • ,n). We have: 
d x Xj £ f i m if and only if dxj € Ct m 

Proof: The fact that d x Xj £ O m implies that dxj £ fl m is obvious since [fl m ,0fc] C Q m by 
definition. Let us prove that also the contrary holds, i.e., that if dxj £ fl m then d x Xj £ fl m . 
Since dxj £ tl m we have dxj = c \ c i where ■ ■ ■ , are N\ generators 

of [fl m , 0 fc ] and tof, w|, • • • ,w^ 2 are N 2 generators of L m . We want to prove that N 2 = 0. 

We proceed by contradiction. Let us suppose that N 2 > 1. We remark that the first set of 
generators have the last k entries equal to zero, as for dxj. The second set of generators consists 
of the Lie derivatives of dh along G up to the m order. Let us select the one that is the highest 
order Lie derivative and let us denote by j' this highest order. We have 1 < N 2 < j' < m. By 
a direct computation, it is immediate to realize that this is the only generator that depends on 
Specifically, the dependence is linear by the product _1 ) (we remind the reader 

that Lg ^ 0). But this means that dxj has the (n + j') th entry equal to Lg ^ 0 and this is not 
possible since dxj = [d x Xj,0k] ■ 

A fundamental consequence of this lemma is that, if we are able to prove that Ll m = Q m , the 
weak local observability of the original state x, can be investigated by only considering the 
codistribution fl m . In the rest of this section we will prove this fundamental theorem, stating 
that — fl^. 

Theorem 1 (Separation) f t m = = [fi m ,0fc] + L m 

The proof of this theorem is complex and is based on several results that we prove before. Based 
on them, we provide the proof of the theorem at the end of this section. 

Lemma 3 Cc& m + C$ m dh. = + CpC G dh 

Proof: We have C F C%dh = L G L F CJfr\lh + C^ >1 C G ~ 1 dh. 

The first term C G C F L G ~ 1 dh £ £ G fl m . Hence, we need to prove that C G fl m + C^> m dh = 
C G n m + C$ 1 C G ~ l dh. We repeat the previous procedure m times. Specifically, we use the 
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equality £<j, ( CJ G ■'dh = CgC^^C'c ■* l dh + C^, j+1 C G J 1 dh, for j = l,--- ,m, and we remove 
the first term since CgC,^-C^~^~ X dh € Cc&m ■ 


Lemma 4 <F m = XlJLi c j ifiGh, C G h, ■ ■ ■ ,C G h)(l)j, i.e., the vector is a linear combination 
of the vectors <pj (j = 1 , • • • , m), where the coefficients (cf) depend on the state only through the 
functions that generate the codistribution L m 


Proof: We proceed by induction. By a direct computation it is immediate to obtain: 

4>i = (fiCch. 

Inductive step: Let us assume that 4> m _i = YlT=i c j(£oh, C G h,- ■ • , CQ~ 1 h)<t>j. We have: 


m— 1 r 


= [$ m _ 1 , G) = 

i=i 


Ofc 


G 


m— 1 


E 


V] 

Ofc 


G 


m —1 


- E Cgc 


i=i 


vo 

Ofc 


We directly compute the Lie bracket in the sum (note that (fj is independent of the unknown 
input w and its time derivatives): 


VO 

Ofc 


, G 


[<An 9}i 

Ofc 


Ofc 


Regarding the second term, we remark that Cg c o = 5H"=i' By setting Cj = 

Cj-\C} G h for j = 2 , ••• ,to and cq = 0 , and by setting Cj = — Y^r=i h) f° r J = 

l,--- ,m — 1 and c m = 0 , we obtain <I> m = (c? + Cj)(j>j , which proves our assert since 

c"(= c ; y + Cj) is a function of Cch, C G h, ■ ■ ■ , C G h ■ 

It also holds the following result: 


Lemma 5 <f> m = (j^-ch, C^h, ■ ■ ■ ,C G h)$j, i.e., i/ie vector is a linear combination 

of the vectors (y = 1, • • • , ?n,), where the coefficients (bf) depend on the state only through the 
functions that generate the codistribution L m 

Proof: We proceed by induction. By a direct computation it is immediate to obtain: <f> \ = 

$17^. 

1 V-Gh 

Inductive step: Let us assume that (f m -i = bj(Cch, C G h, ■ ■ ■ ,C G ~ 1 h)^j. We need to 

prove that (f m = J2JLi bf(Cch, £ G h, ■ ■ ■ , We start by applying on both members of 

the equality $ m -i = J2p=i 1 bj(jC.Gh,jC. G h, - ■ ■ , Cfi~ 1 h)^j the Lie bracket with respect to G. We 
obtain for the first member: [(j> m - i, G\ = <p m C G h. For the second member we have: 


m —1 m— 1 m —1 

E G] = E G ] - E tGbj*, = 


j =1 


i =i 


j=i 


m—1 m—lm—1 

= E • i -EE 

j —i i=i *=i 


d(C G h) 


Zg'hQj 
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By setting bj = for j = 2, • • ■ , m and b\ = 0, and by setting bj = — d(C* h^C 1 ^ ^ or 

j = 1, ■ • ■ , m — 1 and = 0, we obtain <f m = YJLi( bj + which proves our assert since 

bj(= bj + bj) is a function of Cah, C G h, • • • , Cffh ■ 

An important consequence of the previous two lemmas is the following result: 

Proposition 5 The following two codistributions coincide: 

1. span{C$> 0 dh , C&^h, • • • , £$ m dh, C G dh, ■ ■ ■ C G dh}; 

2. spanfC^dh, C^dh,- ■ ■ ,£^ dh,C G dh,-■ ■ Cfidh}; 

We are now ready to prove theorem [T] 

Proof: We proceed by induction. By definition, Ho = Ho since they are both the span of dh. 
Inductive step: Let us assume that f2 m _i = We have: f \ m = f2 m _i + Cp&m-i + 

^G^m —1 = iim -1 + ^F^m -1 + ^G^m -1 = 1 + Ofc] + ™ 1 + £-G&m- 1 - On the 

other hand, CfL m_1 = Cptf-cdh + • • • + Cp£Q~ 2 dh + CfCq 1 dh. The first in —2 terms are in 
n m _i. Hence we have: f2 m = H m _i + [£/S2 m _i,0fc] + CF^ G ~ 1 dh + Cc^m- 1 - By using lemma 
[ 3 ] we obtain: f2 m = fj TO _i + [Cfd m _i, 0*,] + £<& m l dh + By using again the induction 

assumption we obtain: = [fl m _i,0fc] +L m_1 + [£/fi m _i,0fc] + C® m _ 1 dh + £ G [fl m _i, 0&] + 

CqL m_1 = [H m _i,0fc]+L m +[£/fl m _i,0fc]+£$ m _ 1 ci/i+[£ a and by using proposition 

0 L? 

we obtain: = [H m _i,0 fc ] + L m + [£/f2 m _i,0 fe ] + C^_ i dh+ 0 fc ] = fi m ■ 

7 Convergence of the algorithm that defines $1 

Theorem [l] is fundamental. It allows us to obtain all the observability properties of the original 
state by restricting the computation to the 11 codistribution, namely a codistribution whose 
covectors have the same dimension of the original space. In other words, the dimension of these 
covectors is independent of the state augmentation. The fl codistribution is defined recursively 
and fl m C fl m+ i (see definition [3] in section [6]). This means that, if for a given m the gradients 
of the components of the original state belong to fl m , we can conclude that the original state 
is weakly locally observable. On the other hand, if this is not true, we cannot exclude that it 
is true for a larger m. The goal of this section is precisely to address this issue. We will show 
that the algorithm converges in a finite number of steps and we will also provide the criterion to 
establish that the algorithm has converged (theorem |2|. This theorem will be proved at the end 
of this section since we need to introduce several important new quantities and properties. 

For a given positive integer j we define the vector ifj £ R™ by the following algorithm: 

1- V'o = /; 

2. ifj = [^_i, {t} 

It is possible to find a useful expression that relates these vectors to the vectors <fij, previously 
defined. Specifically we have: 


Lemma 6 It holds the following equation: 


tfj 



V '£V 1 



L l 


( 20 ) 
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Proof: We proceed by induction. By definition tpo = 
3=0. 

Inductive step: Let us assume that it holds for a given j 
for j. We have: 


= / and equation (201 holds for 


— 1 > 0 and let us prove its validity 


= 


V’j-i, 


L l 


, 9 

Pj ~ u Ll 


9 J 


( £<t>i Ll g 1 l 9 9 


On the other hand: 


and 


, 5 

h- 1 ' Ll 


9-1 


L l L l 


0-2 




. 2=0 


— 2—2 ( ^i L g \ 1 _3_ _3_ 

l L l )\LY L l 


0-2 




„ 2=0 


—z—2 f £<pjLl \ 1 


r j —2 




/ /• T 1 
i — 2 /*.7 2 1 / 


Hence: 


0.7 = 0.7 ^ 


. 2=0 


C 6i _,L\ 


L l L l 


0-2 


^ U=0 




r , t l 

—2 /».? —7—l / a 


L l \ L Y 


which coincides with (201 


We have the following result: 

Lemma 7 For i = 0,1, • • • , mn — 2, we have: 

d 


U,L\ 


L l 




( 21 ) 


Proof: By construction, dxC^h £ for any i = 1, • • • , m — 1. On the other hand, we 
have: 


1 


C&; 1 Ll 
- 1 9 - 


£<t>ih — l [C^ i _ 1 C g h C g C c j >i _ 1 h] — , 1 

L g L g 

We compute the gradient of both members of this equation. Since d x L_a_L^ >i _ l h £ Ll m , for 

C, Z/ 1 

any i = 1, ■ • • , m — 1, also d x "‘ L l - £ Ll m ■ 

From lemma[6]with j = 1, • ■ • , m — 1 and lemma [7] it is immediate to obtain the following result: 

Proposition 6 If Ll m is invariant with respect to Cf and £_s_ then it is also invariant with 
respect to /E., j = 1 , • ■ • , m — 1 . 
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Let us denote by L 2 = £ 2 h and by p = -pfjyi- 
Lemma 8 We have the following key equality: 


£<t>j h — ^03-2 P “I” P 


r f -2 L], 

-Lr- c ^[-^r +c ^ h / 


3> 2- 

Proof: We will prove this equality by an explicit computation. We have: 

= yy (■^'03-1 ~ ^g^<t>j- i^ 1 ) 

L g 


( 22 ) 


The second term on the right hand side simplifies with the last term in (221. Hence we have 
to prove: 


We have: 


We remark that: 


J r rl _ r „ i „ -^j-2-kg r ^03-2^g 


jjl^i-^g (L 1 ) 2 •^'9-^03-2-^g) 


and 


£<bj-2 L g 


(L 1 ) 2 £’<t‘j-2p + ^P JJL 


(23) 

(24) 


r rl r rl 

r r rl_ g , r '~<t>j-2 lj g 

LgL^ 2 Lq - P -71-I- L_- 


( L 1 ) 2 5 ^- 2 9 


L l 


L l 


By substituting these two last equalities in (241 we immediately obtain (231 


Lemma 9 In general, it exists a finite to such that d x p G fi m . 

Proof: For a given to, H m contains all the covectors d x C r j )j h (j = 0. ■ • • , to— 1). From equation 
(22), we immediately obtain that, for a given m > 3, fl m contains the covectors (j = 0, • • • , to—3): 

(25) 


Pj — dx£(j)j+2h d x Pj + XjdxP “i - pdxXj (d x Xj dx£(j)j+ih') 

(L L ^ 

with d x pj = dxCcfrjP and \j = £1 " ■ 

By using lemma [7] we obtain the following results: 


• pdxXj € j = 0, • • ■ , to. - 2; 

• Cj^dxXj G j = 0, • • • , to - 3; 
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Additionally, also C a dxC^^h £ £t m , j = 0, • • • ,m — 3. Hence, from (251, we obtain that 
Q m also contains the covectors (j = 0, • ■ • , m — 3): 


/i' = d x pj + XjdxP 

Let us denote by j* the smallest integer such that: 


(26) 


f-i 

d x Pj* — ^ ) Cjd x Pj T c_id x h 

3=0 


(27) 


Note that j* is a finite integer and in particular j* < n — 1, where n is the dimension of the 
state. Indeed, if this would be not the case, the dimension of the codistribution generated by 
d x h , d x po, d x pi, ■ ■ ■ , d x p n - 1 would be n + 1, i.e., larger than n. 


From (271 and (261 we obtain: 


i*-i 


Pj* — ^ ) Cjd x Pj T C— i d x h “t - Xj*d x P 

3=0 


(28) 


From equation (261 , for j = 0, • • • , j* — 1, we obtain: = p' — Xjd x p- By substituting in (28 ) 

we obtain: 


r -1 


f-i 


j* 


/ ^ brj 
J=0 


Cj/r' -c-id x h= - ^2 c jXj + Xj* d x 


(29) 


3=0 


We remark that the left hand side consists of the sum of covectors that belong to Since in 
general Xj* 7^ Xq^c) 1 c iXji we conclude that d x p £ ■ 

The previous lemma ensures that it exists a finite m such that d x p £ Q m . In particular, from 
the previous proof, it is possible to check that this value of m cannot exceed n + 2. The following 
theorem allows us to obtain the criterion to stop the algorithm in definition [3] 

Theorem 2 If d x p £ f l m and f l m is invariant under Cf and C_b^, then = fl m Vp > 0 

Proof: We proceed by induction. Obviously, the equality holds for p = 0. 

Inductive step: let us assume that S2 m+p = and let us prove that fl m+p+ i = f l m . We 
have to prove that d x £j, m+p h £ fl m . Indeed, from the inductive assumption, we know that 
H m+p (= fl m ) is invariant under Cf and C_a_. Additionally, because of this invariance, by using 

proposition [6] we obtain that fl m is also invariant under C r f lj , for j = 1,2, , m + p — 1. Since 


d x p £ fi m we have d x Cj,. 


0 


7 we also have dx 


+P 

C-d> -L~ O L 


_ 2 p £ f l m . Additionally, d x Cj >m+p _ 1 h £ Cl m and, because of lemma 
£ Cl m . Finally, because of the invariance under C a, also the 


Lie derivatives along A- of dxCj,^ _ x h and d x — m ~t p 1 2 —- belong to fl m . Now, we use equation 
(22 ) for j = m + p. By computing the gradient of this equation it is immediate to obtain that 


d x Cj, m+v h € I 

We conclude this section by providing an upper bound for the number of steps necessary 
to achieve the convergence. The dimension of Op +2 is at least the dimension of the span of 
the covectors: d x h, d x p! 0 , dxp'^, , d x p!j,_ l . From the definition of j*, we know that the 
vectors d x h , d x po, d x pi, , d x pj*-\ are independent meaning that the dimension of their 


span is j* + 1. Hence, from (261, it easily follows that the dimension of the span of the vectors 
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d x h , d x fj,' 0 , dxfi'h , d x fAj *_^, d x p is at least j* + 1. Since Q 7 * + 3 contains this span, its 
dimension is at least j* + 1. Therefore, the condition n m +i = fl m , for m > j* + 3 is achieved 
for to < n + 2. 


8 Extension to the case of multiple known inputs and method’s 
summary 

The previous two sections provide a complete answer to the problem of deriving all the observ¬ 
ability properties of a system whose dynamics is driven by a single known input and a single 
unknown input and that depend non-linearly on the state and linearly on both the inputs. Before 
providing the steps to be followed in order to obtain the weak local observability properties of 
such a system, we remark that it is possible to extend our results to the case of multiple known 
inputs. This extension is simply obtained by re-defining the ff codistribution. 

We are referring to the nonlinear system characterized by the following equations: 


{ m u 

X = fi(x)ui + g{x)w 
i= 1 
y = h(x) 

The new f 1 codistribution is defined as follows: 


(30) 


Definition 5 (Q codistribution, m w = 1, Vm u ) This codistribution is defined recursively by 
the following algorithm: 


1 . flo — d x h, 

2. = ffm-l + Ei=l + C-^.flm-1 + E™" C,^ _d x fl 

where the vectors (j) l m £ R" (* = !,■•• ,m u ) are defined by the following algorithm: 


i■ <t>h = fa 


S) JA _ [&m — 1 1 ff] 

Ym — L 1 


It is immediate to repeat all the steps carried out in section [6] and extend the validity of theorem 
[l] to the system characterized by ( |30| ) . This extension states that all the observability properties 


of the state that satisfies the nonlinear dynamics in (30) can be derived by analyzing the codis¬ 
tribution defined by definition [5] Finally, also theorem |2| can be easily extended to cope with the 
case of multiple known inputs. In this case, requiring that fl rra+ 1 = f l m means that Q m must be 
invariant with respect to and all Lf i simultaneously. 


We conclude this section by outlining the steps to investigate the weak local observability at 
a given point Xq of a nonlinear system driven by a single disturbance and several known inputs. 
In other words, to investigate the weak local observability of a system defined by a state that 
satisfies the dynamics in (301. The validity of the following procedure is a consequence of the 
theoretical results previously derived (in particular theorem [l] and theorem [2]) . This procedure 
is also available in 1361. 
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1. For the chosen Xo, compute L g (= C g h) and P (^— (ia )2 ) • In the case when L g = 0, 
introduce new local coordinates, as explained at the end of section [5] and re-define the 
output^ 

2. Build the codistribution fl m (at Xq) by using the algorithm provided in definition|5] starting 
from m = 0 and, for each to, check if d x p £ fl m . 


3. Denote by m! the smallest to such that d x p £ 

4. For each to > m! check if f2 m+1 = and denote by f2* = f2 m » where to* is the smallest 
integer such that to* > m! and D m . + i = f2 m » (note that m* < n + 2). 

5. If the gradient of a given state component (xj, j = 1, • • • ,ri) belongs to fl* (namely if 
d x Xj £ D*) on a given neighbourhood of Xq, then Xj is weakly locally observable at Xq. 
If this holds for all the state components, the state x is weakly locally observable at Xq- 
Finally, if the dimension of f i* is smaller than n on a given neighbourhood of Xq, then the 
state is not weakly locally observable at Xq. 


9 Applications 


We apply the theory developed in the previous sections in order to investigate the observability 
properties of several nonlinear systems driven by unknown inputs. In |9.1| we consider systems 


with a single disturbance, namely characterized by the equations given in (30). In this case we 
will use the results obtained in sections Him and [5] In particular, we will follow the steps 
outlined at the end of section [8] In |9.2| we consider the case of multiple disturbances, i.e., when 
the state dynamics satisfy the first equation in 0. In this section, we also consider the case of 
multiple outputs and we use directly the EORC , as discussed in section [4] 


9.1 Systems with a single disturbance 

We consider a vehicle that moves on a 2D-environment. The configuration of the vehicle in a 
global reference frame, can be characterized through the vector [x v , y v , 9 V ] T where x v and y v are 
the cartesian vehicle coordinates, and 9 V is the vehicle orientation. We assume that the dynamics 
of this vector satisfy the unicycle differential equations: 

x v = v cos 9 V 

y v = v sin 9 V (31) 

0 V = to 

where v and w are the linear and the rotational vehicle speed, respectively, and they are the system 
inputs. We consider the following three cases of output (see also figure [l] for an illustration): 

1. the distance from the origin (e.g., a landmark is at the origin and its distance is measured 
by a range sensor); 

2. the bearing of the origin in the local frame (e.g., a landmark is at the origin and its bearing 
angle is measured by an on-board camera); 

2 Note that in the case of multiple known inputs, for the local coordinates we have the possibility to choose 
among the m u functions fi. The most convenient choice is the one that corresponds to the highest relative degree 
(if this degree coincides with n it means that the state is weakly locally observable and we do not need to pursue 
the observability analysis). 
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Figure 1: The vehicle state in cartesian and polar coordinates together with the three considered 
outputs. 


3. the bearing of the vehicle in the global frame (e.g., a camera is placed at the origin). 

We can analytically express the output in terms of the state. We remark that the expressions 
become very simple if we adopt polar coordinates: r = + y%, (j> = atan^r. We have, for 

the three cases, y = r, y = tt — ( 9 V — (j >) and y = (f>, respectively. For each of these three cases, 
we consider the following two cases: v is known, to is unknown; v is unknown, w is known. The 
dynamics in these new coordinates become: 

r = v cos (9 V — (/)) 

<t>=- sin((9„ - </>) (32) 

_d v = oj 


9.1.1 y = r, u = w, w = v 



' 0 ' 


cos (0 V — (j)) 

In this case we have / = 

0 

and g = 

sin (0 v — (f>) 


1 


0 


We follow the five steps mentioned at the end of section |8] 
p = = tan . Additionally: 


d x p = 


tan {6 V — (j)) 


tan(0„ — (j)) 


cos 2 {6 V — 


We have L l g = cos (9 V 

2 

(j>) ’ cos 2 {9 V — (/))_ 


cf >) and 


We also have flo = span{[ 1, 0, 0]}. Hence, d x p ^ Ho- Additionally, Hi = Ho- We need to compute 

— tan {0 V — (j)) 

0,2 and, in order to do this, we need to compute </>i. We obtain: <pi = 


and 


0,2 


span 


{[ 1 , 0 , 0 ], 


5 cos 2 (6 v —(p) 


1 

cos 2 (0 v — <p) 


}■ 


It is immediate to check that d x p £ O 2 , 


meaning that m! = 2. Additionally, by a direct computation, it is possible to check that 0 3 = 0 2 
meaning that m* = 2 and O* =O 2 , whose dimension is 2. We conclude that the dimension of 
the observable space is equal to 2 and the state is not weakly locally observable. 
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9.1.2 y = r, u = v, w = w 


In this case we have / = 

cos(0„ — <j>) 

sin (0 v —<fr) 

and g = 

' 0 ' 
0 


0 


1 


We follow the five steps mentioned at the end of section pi We easily obtain L 1 = 0. Hence, 


we have to introduce new local coordinates, as explainedat the end of section [5j We obtain 
C l jh = cos [0 V — <j>) and we obtain that the relative degree of the associated system in (151 is 
r = 2. Let us denote the new coordinates by x \, x' 2 , x' 3 . In accordance with (161 and (fl7|) we 


should set x[ = r and x' 2 = cos (8 V — </>). On the other hand, to simplify the computation, we set 
x 2 = 0 V — <f>. Finally, we set x' 3 = 9 V . We compute the new vector fields that characterize the 
dynamics in the new coordinates. We have: 


cos(£ 2 ) 

f = ain(gg) 

^ x'i 

0 

Additionally, we set h = cos^) and Hi = span{\ 1, 0,0], [0, — sin^), 0]}. In the new coordinates 
we obtain: Lg = — sin(:r 2 ) and p = — • It is immediate to check that d x p £ Hi, meaning 

that m! = 1. Additionally, by a direct computation, it is possible to check that H 2 = Hi meaning 
that m* = 1 and fi* = Hi, whose dimension is 2. We conclude that the dimension of the 
observable space is equal to 2 and the state is not weakly locally observable. 


(33) 


9.1.3 y = 6 V — 4>, u = to 

, w = 

= V 



' 0 ' 


cos (9 V — </>) 

In this case we have / = 

0 

and g = 

sin {9 v — <p) 


1 


0 


We follow the five steps mentioned at the end of section I We have Lg = — Sln ^ ^ and 
p = 2 cot (0 V — <f>). Additionally: 


d x p 



T7 [°,!) -!] 

9) 


We also have H 0 = span{[ 0, —1,1]}. Hence, d x p £ H 0 , meaning that m' = 0. Additionally, by 
a direct computation, it is possible to check that Hi = Ho meaning that m* — 0 and H* = Ho, 
whose dimension is 1. We conclude that the dimension of the observable space is equal to 1 and 
the state is not weakly locally observable. 


9.1.4 y = () v - 


U = V, W = OJ 


cos(0„ — 4>) 


r 0 1 

sin (0 v —<t)) 
r 

and g = 

0 

0 


1 


In this case we have / = 


We follow the five steps mentioned at the end of section [8] We have L ] g = 1 and p = 0. 
Hence, d x p = [0,0,0] and we do not need to check if d x p £ fi m . In other words, we can set 
m' = 0. By a direct computation we obtain: Ho = span{[0, — 1,1]}, Hi = span {[0, — 1,1], 
cos(8v-ci>) ^ cos (8v-cp) | Additionally, we obtain H 2 = Hi, meaning that m* = 1 

and H* = Hi, whose dimension is 2. We conclude that the dimension of the observable space is 
equal to 2 and the state is not weakly locally observable. 
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9.1.5 y = cf>, u = uj, w = v 


In this case we have / = 

' 0 ' 
0 

and g = 

cos(0„ — (f>) 

sin (6 v — cf>) 


1 


0 


We follow the five steps mentioned at the end of section a We have Lg = Sln ( e " ^ anc i p = 
—2 cot (0„ — </>). Additionally: 


d x p = 



9) 


We also have S7 q = span{[0, 1,0]}. Hence, d x p ^ H 0 . 
compute H 2 and, in order to do this, we need to compute 


Additionally, Hi 
</>i. We obtain: cj. 


= Ho- We need to 
—r 

L = COt(0„ - 0) 


0 


and H 2 = span | [0,1, 0], sin2( -g [0,1, —1]|. It is immediate to check that d x p £ meaning 
that rnn' = 2. Additionally, by a direct computation, it is possible to check that fi 3 = fi 2 meaning 
that m* = 2 and fi* = SA 2 , whose dimension is 2. We conclude that the dimension of the 
observable space is equal to 2 and the state is not weakly locally observable. 


9.1.6 y = <j), u = v, w = ui 



cos(0„ — (j>) 


' 0 ' 

In this case we have / = 

sin(0 l , ~4>) 

and g = 

0 


0 


1 


We follow the five steps mentioned at the end of section pi We easily obtain L J = 0. Hence, 
we have to introduce new local coordinates, as explainedat the end of secti 

h = Sln ( e ^~^) an d we obtain that the relative degree of the associated system in (15 
Let us denote the new coordinates by x \, x 2 , x 3 . In accordance with (161 and ( |l 
and x' 2 = Sln ( e v- ( t > '> . Finally, we set x 3 = cos ^"~^) _ 

We compute the new vector fields that characterize the dynamics in the new coordinates. We 
obtain: 


We obtain 
is r = 2. 
weset x\ = <f> 


x 2 


0 

-2x' 2 x 3 

III 

*3 

f 2 2 



x 2 x 3 


1 

1 

H 

to 


(34) 


Additionally, we set h = x' 2 and Hi = span{[ 1,0, 0], [0,1,0]}. In the new coordinates we obtain: 
Lg = x' 3 and p = Since p depends on x 3 , d x p (f Hi. Since the dimension of is already 

2, because of lemma |9] we know that it exists a given integer m such that the dimension of fl m 
is larger than 2. Hence, we conclude that the entire state is weakly locally observable. 


9.2 Systems with multiple disturbances 

In this case we refer to the general case, i.e., to systems characterized by the dynamics given 
in (jT|) . For this general case we do not have the results stated by the theorem of separation 
(theorem [lj and we have to compute the entire codistribution and to proceed as it has been 
described in section [4] 

We derive the observability properties of two systems with unknown inputs. The first system 
characterizes a localization problem in the framework of mobile robotics. The state and its 
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dynamics are the same as in the example discussed in |9.1| However, we consider a different 
output and also the case when both the inputs are unknown. For this simple example, the use of 
our theory is not required to derive the observability properties, which can be obtained by using 
intuitive reasoning. 

The second system is much more complex and describes one of the most important sensor 
fusion problem, which is the problem of fusing visual and inertial measurements. We will refer 
to this problem as to the visual-inertial structure from motion problem (the Vi-SfM problem). 
This problem has been investigated by many disciplines, both in the framework of computer 
science 0 H3 HI 1201 EH EH] and in the framework of neuroscience (e.g., [5j [TO) !H]). Inertial 
sensors usually consist of three orthogonal accelerometers and three orthogonal gyroscopes. All 
together, they constitute the Inertial Measurement Unit (IMU). We will refer to the fusion of 
monocular vision with the measurements from an IMU as to the standard Vi-SfM problem. In 
[201 [21 [21 cm HH [501 [37] and [20] the observability properties of the standard Vi-SfM have been 
investigated in several different scenarios. Very recently, following two independent procedures, 
the most general result for the standard Vi-SfM problem has been provided in m and m- 
This result can be summarized as follows. In the standard Vi-SfM problem all the independent 
observable states are: the positions in the local frame of all the observed features, the three 
components of the speed in the local frame, the biases affecting the inertial measurements, the 
roll and the pitch angles, the magnitude of the gravity and the transformation between the camera 
and IMU frames. The fact that the yaw angle is not observable is an obvious consequence of 
the system invariance under rotation about the gravity vector. We want to use here the theory 
developed in the previous sections in order to investigate the observability properties of the Vi- 
SfM problem when the number of inertial sensors is reduced, i.e., when the system is driven by 
unknown inputs. 


9.2.1 Simple 2D localization problem 


We consider the system characterized by the same dynamics given in (31). Additionally, we 


assume that the vehicle is equipped with a GPS able to provide its position. Hence, the system 
output is the following two-components vector: 


V = [%v, VvY 


(35) 


Let us start by considering the case when both the system inputs, i.e., the two functions 
v(t) and w(f), are available. By comparing ([!]) with (311 we obtain x = [x v , y Vl 9 V ] T , m u = 2, 
m w = 0, ui = v, u 2 = u>, fo(x) = [0, 0, 0] T , fi(x) = [cos#„, sin6>„, 0] T and f 2 (x) = [0, 0, 1] T . 

In order to investigate the observability properties, we apply the observability rank crondition 
introduced in m- 

The system has two outputs: h x = x v and h y = y v . By definition, they coincide with their 
zero-order Lie derivatives. Their gradients with respect to the state are, respectively: [1, 0, 0] 
and [0, 1, 0]. Hence, the space spanned by the zero-order Lie derivatives has dimension two. 
Let us compute the first order Lie derivatives. We obtain: C\h x = cos 9 V , C\h y = sinff^, 
C\h x = C\h y = 0. Hence, the space spanned by the Lie derivatives up to the first order span 
the entire configuration space and we conclude that the state is weakly locally observable. 

We now consider th e case when both the system inputs are unknown. In this case, by 
comparing ([lj with (311 we obtain m u = 0, m w = 2, w\ = v, w 2 = w, fo{x) = [0, 0, 0] T , 
gi(x) = [cosfr„, sin#„, 0] T and g 2 {x) = [0, 0, 1] T . 

Intuitively, we know that the knowledge of both the inputs is unnecessary in order to have 
the full observability of the entire state. Indeed, the first two state components can be directly 
obtained from the GPS. By knowing these two components during a given time interval, we also 
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know their time derivatives. In particular, we know x v (0) and ^(0). From (311 we easily obtain: 
9 V (0) = atari (^rj^j ■ Hence, also the initial orientation is observable, by only using the GPS 


measurements. 

Let us proceed by applying the EORC, discussed in section [4] We start by computing the 
codistribution Hq in E^V We easily obtain: 


fl Q = span{[ 1, 0, 0], [0, 1, 0]} 

From this we know that x v and y v are weakly locally observable. We want to know if also 0 V is 
weakly locally observable (in which case the entire state would be weakly locally observable). We 
have to compute Hi in E*H. For, we build the system E*H. We have: 1 x = [x Vl y v , 9 V , v, uj] t . 
We can easily obtain the analytical expression for the quantities appearing in (|3]). We have: 
/q 1 \x) = [cos0 t ,u, sin(9„u, 0, 0] T . We compute the analytical expression of the first-order 
Lie derivatives along this vector filed. We have: C\h x = \7h x ■ x) = [1, 0, 0, 0, 0] ■ 

[cos 9 v v ) sin 9 v v, u>, 0, 0] = cos 9 v v (similarly, we obtain C\h y = sin0„u). We obtain: 

Hi = span{[ 1,0,0,0,0], [0,1,0,0,0], 

[0,0, — sin0„v, cos 9 V , 0], [0,0, cos 9 v v, sin 9 V , 0]} 

from which we obtain that the gradient of 9 V belongs to Hi. Therefore, also 9 V is weakly locally 
observable and so the entire original state. 


9.2.2 The Vi-SfM with partial input knowledge 

For the brevity sake, we do not provide here the computation necessary to deal with this problem. 
All the details are available in |331 [51] (see also the work in [25 for the definition of continuous 
symmetries). Here we provide a summary of these results. First of all, we remark that the 
Vi-SfM problem can be described by a nonlinear system with six inputs (3 are the accelerations 
along the three axes, provided by the accelerometers, and 3 are the angular speeds provided by 
the gyroscopes). The outputs are the ones provided by the vision. In the simplest case of a single 
point feature, they consist of the two bearing angles of this point feature in the camera frame. 
We analyzed the following three cases: 

1. camera extrinsically calibrated, only one input known (corresponding to the acceleration 
along a single axis); 

2. camera extrinsically uncalibrated, only one input known (corresponding to the acceleration 
along a single axis); 

3. camera extrinsically uncalibrated, two inputs known (corresponding to the acceleration 
along two orthogonal axes). 

The dimension of the original state is 12 in the first case and 23 in the other two cases. Ad¬ 
ditionally m u = 1 and m w = 5 in the first two cases while m u = 2 and m w = 4 in the last 
case. 

In [55) [55] we prove that the observability properties of Vi-SfM do not change by removing all 
the three gyroscopes and one of the accelerometers. In other words, exactly the same properties 
hold when the sensor system only consists of a monocular camera and two accelerometers. To 
achieve this result, we computed the Lie derivatives up to the second order for the third case 
mentioned above. By removing a further accelerometer (i.e., by considering the case of a monoc¬ 
ular camera and a single accelerometer) the system loses part of its observability properties. In 
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particular, the distribution A fc (= f2^), Vfc > 2, contains a single vector. This vector describes 
a continuous symmetry that is the invariance under the rotation around the accelerometer axis. 
This means that some of the internal parameters that define the extrinsic camera calibration, are 
no longer identifiable. Although this symmetry does not affect the observability of the absolute 
scale and the magnitude of the velocity, it reflects in an indistinguishability of all the initial 
speeds that differ for a rotation around the accelerometer axis. On the other hand, if the camera 
is extrinsically calibrated (i.e., if the relative transformation between the camera frame and the 
accelerometer frame is known (first case mentioned above)) this invariance disappears and the 
system still maintains full observability, as in the case of three orthogonal accelerometers and 
gyroscopes. The analysis of this system (the first case mentioned above) has been done in the 
extreme case when only a single point feature is available. This required to significantly augment 
the original state. In particular, in E3IM we compute all the Lie derivatives up to the 7 th order, 
i.e., we included in the original state the 5 unknown inputs together with their time-derivatives 
up to the six order. We prove that the gradient of any component of the original state, with the 
exception of the yaw angle, is orthogonal to the distribution A k , Vfc > 7 (see the computational 
details in [331135) 0 


3 Note that, the yaw angle is not observable even in the case when all the 6 inputs are known. The fact that 
the yaw is unobservable is a consequence of a symmetry in the considered system, which is the system invariance 
under rotations about the gravity axis. 
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10 Conclusion 


In this paper we investigated the problem of nonlinear observability when part (or even all) of 
the system inputs is unknown. We made the assumption that the unknown inputs are differen¬ 
tiable functions of time (up to a given order). The goal was not to design new observers but to 
provide simple analytic conditions in order to check the weak local observability of the state. An 
unknown input was also called disturbance. The analysis started by extending the observability 
rank condition. This was obtained by a state augmentation and was called the extended observ¬ 
ability rank condition. In general, by further augmenting the state, the observability properties 
of the original state also increase. As a result, the extended observability rank condition only 
provides a sufficient condition for the weak local observability of the original state since the state 
augmentation can be continued indefinitely. Additionally, the computational cost demanded to 
obtain the observability properties through the extended observability rank condition, dramat¬ 
ically depends on the dimension of the augmented state. For these reasons, we focused our 
investigation on the following two fundamental issues. The former consisted in understanding if 
there exists a given augmented state such that, by further augmenting the state, the observabil¬ 
ity properties of the original state provided by the extended observability rank condition remain 
unvaried. The latter consisted in understanding if it is possible to derive the observability prop¬ 
erties of the original state by computing a codistribution defined in the original space, namely a 
codistribution consisting of covectors with the same dimension of the original state. Both these 
issues have been fully addressed in the case of a single unknown input. In this case, we provided 
an analytical method to operate a separation on the codistribution computed by the extended 
observability rank condition, i.e., the codistribution defined in the augmented space. Thanks to 
this separation, we introduced a method able to obtain the observability properties by simply 
computing a codistribution that is defined in the original space (theorem [TJ. The new codistri¬ 
bution is defined recursively by a very simple algorithm. Specifically, the algorithm in definition 
[3] in section [6] (for the case of a single known input) and in definition [5] in section [8] (for the case 
of multiple known inputs). Hence, the overall method to obtain all the observability properties 
is very simple. On the other hand, the analytic derivations required to prove the validity of this 
separation are complex and we are currently extending them to the multiple unknown inputs 
case. Finally, we showed that the recursive algorithm converges in a finite number of steps and 
we also provided the criterion to establish if the algorithm has converged (theorem [2| . Also this 
proof is based on several tricky and complex analytical steps. 

Both theorems [l] and [ 2 ] have first been proved in the case of a single known input (sections [6] 
and [7]) but in section [8] their validity was extended to the case of multiple known inputs. 

All the theoretical results have been illustrated by deriving the observability properties of 
several nonlinear systems driven by known and unknown inputs. 


26 


References 


[1] J.-P. Barbot, M. Fliess, T. Floquet, "An algebraic framework for the design of nonlinear 
observers with unknown inputs", IEEE Conf. on Decision and Control, New-Orleans, USA, 
2007 

[2] J.-P. Barbot, D. Boutat, T. Floquet ,"An observation algorithm for nonlinear systems with 
unknown inputs", Automatica, Volume 45, Issue 8, August 2009, Pages 1970-1974 

[3] G. Basile and G. Marro. On the observability of linear, time invariant systems with unknown 
inputs. J. Optimization Theory Appl., 3:410-415, 1969. 

[4] F. A. W. Belo, P. Salaris, and A. Bicchi, 3 Known Landmarks are Enough for Solving Planar 
Bearing SLAM and Fully Reconstruct Unknown Inputs, The 2010 IEEE/RSJ International 
Conference on Intelligent Robots and Systems October 18-22, 2010, Taipei, Taiwan 

[5] A. Berthoz, B. Pavard and L.R. Young, Perception of Linear Horizontal Self-Motion Induced 
by Peripheral Vision Basic Characteristics and Visual-Vestibular Interactions, Exp. Brain 
Res. 23, 471-489 (1975). 

[6] S.P. Bhattacharyya Observer design for linear systems with unknown inputs IEEE Transac¬ 
tions on Automatic Control, 23 (1978) 

[7] M. Bryson and S. Sukkarieh, Observability Analysis and Active Control for Airbone SLAM, 
IEEE Transaction on Aerospace and Electronic Systems, vol. 44, no. 1, 261-280, 2008 

[8] Chen, W. and Saif, M. (2006). Unknown input observer design for a class of nonlinear systems: 
an LMI approach. In IEEE American Control Conference. 

[9] Darouach, M., Zasadzinski, M., Xu, S. J. (1994). Full-order observers for linear systems with 
unknown inputs. IEEE Transactions on Automatic Control, 39(3) 

[10] Dokka K., MacNeilage P. R., De Angelis G. C. and Angelaki D. E., Estimating distance 
during self-motion: a role for visual-vestibular interactions, Journal of Vision (2011) 11(13) :2, 
1-16 

[11] C. R. Fetsch, G. C. DeAngelis and D. E. Angelaki, Visual-vestibular cue integration for 
heading perception: Applications of optimal cue integration theory, Eur J Neurosci. 2010 
May ; 31(10): 1721-1729 

[12] Floquet, T., Barbot, J. (2004). A sliding mode approach of unknown input observers for 
linear systems. In 43rd IEEE conference on decision and control. Atlantis, Paradise Island, 
Bahamas. December. 

[13] Floquet, T., Edwards, C., Spurgeon, S. (2007). On sliding mode observers for systems with 
unknown inputs. Int. J. Adapt. Control Signal Process Vol 21, no. 89, 638-656, 2007. 

[14] Guan, Y., Saif, M. (1991). A novel approach to the design of unknown input observers. 
IEEE Transactions on Automatic Control, 36(5). 

[15] R. Guidorzi and G. Marro. On Wonham stabilizability condition in the synthesis of observers 
for unknown-input systems. Automatic Control, IEEE Transactions on, 16(5):499-500, oct 
1971. 


27 



[16] Chao X. Guo and Stergios I. Roumeliotis, IMU-RGBD Camera 3D Pose Estimation and 
Extrinsic Calibration: Observability Analysis and Consistency Improvement, ICRA 2013, 
Karlsruhe, Germany 

[17] Q.P. Ha, H. Trinh State and input simultaneous estimation for a class of nonlinear systems 
Automatica, 40 (2004). 

[18] H. Hammouri and Z. Tmar. Unknown input observer for state affine systems: A necessary 
and sufficient condition. Automatica, 46(2):271-278, 2010. 

[19] Hermann R. and Krener A.J., 1977, Nonlinear Controllability and Observability, Transaction 
On Automatic Control, AC-22(5): 728-740 

[20] J. A. Hesch, D. G. Kottas, S. L. Bowman, and S. I. Roumeliotis, Towards consistent vision- 
aided inertial navigation, Int. Workshop on the Algorithmic Foundations of Robotics, Cam¬ 
bridge, MA, Jun, 2012. 

[21] M. Hou, P.C. MAjjller Design of observers for linear systems with unknown inputs IEEE 
Transactions on Automatic Control, 37 (6) (1992). 

[22] Isidori A., Nonlinear Control Systems, 3rd ed., Springer Verlag, 1995. 

[23] E. Jones and S. Soatto, "Visual-inertial navigation, mapping and localization: A scalable 
real-time causal approach", The International Journal of Robotics Research, vol. 30, no. 4, 
pp. 407-430, Apr. 2011. 

[24] J. Kelly and G. Sukhatme, Visual-inertial simultaneous localization, mapping and sensor- 
to-sensor self-calibration, Int. Journal of Robotics Research, vol. 30, no. 1, pp. 56-79, 2011. 

[25] Koenig, D. and Mammar, S. (2001). Design of a class of reduced order unknown inputs 
nonlinear observer for fault diagnosis. In IEEE American Control Conference. 

[26] D. Koening, B. Marx, D. Jacquet Unknown input observers for switched nonlinear discrete 
time descriptor system IEEE Transactions on Automatic Control, 53 (1) (2008). 

[27] D. G. Kottas, J. A. Hesch, S. L. Bowman, and S. I. Roumeliotis, On the consistency of 
vision-aided inertial navigation, in Proc. of the Int. Symposium on Experimental Robotics, 
Canada, Jun 2012. 

[28] M. Li and A. I. Mourikis, Improving the accuracy of EKF-based visualinertial odometry, 
ICRA 2012, St. Paul, MN. 

[29] A. Martinelli, State Estimation Based on the Concept of Continuous Symmetry and Ob¬ 
servability Analysis: the Case of Calibration, IEEE Transactions on Robotics, vol. 27, no. 2, 
pp. 239-255, 2011 

[30] A. Martinelli, Vision and IMU data fusion: closed-form solutions for attitude, speed, abso¬ 
lute scale and bias determination, IEEE Transactions on Robotics, Volume 28 (2012), Issue 
1 (February), pp 44-60. 

[31] A. Martinelli, Visual-inertial structure from motion: observability and resolvability, iros 
2013, Tokyo, Japan 

[32] A. Martinelli, Closed-form solution of visual-inertial structure from motion, Int. J. of Com¬ 
puter Vision, Vol. 106, N. 2, 138-152, 2014. 


28 



[33] A. Martinelli, Visual-inertial structure from motion: observability vs minimum number of 
sensors, ICRA 2014, Hong Kong. 

[34] A. Martinelli, Observability Properties and Deterministic Algorithms in Visual- 
Inertial Structure from Motion, Foundations and Trends in Robotics, 139-209, 
http://dx.doi.org/10.1561/2300000030, 2014 

[35] A. Martinelli. Extension of the Observability Rank Condition to Nonlinear Systems Driven 
by Unknown Inputs. 23th Mediterranean Conference on Control and Automation (MED), 
June, 2015. Torremolinos, Spain 

[36] A. Martinelli. Nonlinear Unknown Input Observability: Analytical Expression of the Ob¬ 
servable Codistribution in the Case of a Single Unknown Input. The SIAM Conference on 
Control and Its Applications (CT15). July, 2015. Paris, France 

[37] Mirzaei F.M. and Roumeliotis S.I., A Kalman filter-based algorithm for IMU-camera cali¬ 
bration: Observability analysis and performance evaluation, Transactions on Robotics, 2008, 
Vol. 24, No. 5, 1143-1156 

[38] D. Strelow and S. Singh, Motion estimation from image and inertial measurements, Inter¬ 
national Journal of Robotics Research, 23(12), 2004 

[39] S.H. Wang, E.J. Davison, P. Dorato Observing the states of systems with unmeasurable 
disturbance IEEE Transactions on Automatic Control, 20 (1975) 

[40] Weiss., S., Vision Based Navigation for Micro Helicopters, PhD thesis, Diss. ETH No. 20305 

[41] F. Yang, R.W. Wilde Observer for linear systems with unknown inputs IEEE Transactions 
on Automatic Control, 33 (7) (1988). 


29 



